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How rare are diffusive rare events? 
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Abstract. - We study the time until first occurrence, the first-passage time, of rare density 
fluctuations in diffusive systems. We approach the problem using a model consisting of many 
independent random walkers on a lattice. The existence of spatial correlations makes this problem 
analytically intractable. However, for a mean-fleld approximation in which the walkers can jump 
anywhere in the system, we obtain a simple asymptotic form for the mean first-passage time 
to have a given number k of particles at a distinguished site. We show numerically, and argue 
heuristically, that for large enough k, the mean-field results give a good approximation for first- 
passage times for systems with nearest-neighbour dynamics, especially for two and higher spatial 
dimensions. Finally, we show how the results change when density fluctuations anywhere in the 
system, rather than at a specific distinguished site, are considered. 



Introduction. — Rare events control the kinetics of 
many physical systems. They are frequently associated 
with activated processes, corresponding to the crossing of 
a high free-energy barrier, as happens, for example, in 
nucleation processes [1]. On the other hand, in diffusive 
systems, where particles undergo Brownian motion, there 
are no such energy barriers. Here, questions arise such 
as: what is the time required for a particle to first reach 
a particular region of space, or for a pair of particles to 
meet for the first time? Depending on the circumstances, 
such events can be very rare, in which case they may be 
the limiting step in the dynamics of the system. It is then 
crucial to understand when such events will occur. 

Diffusive systems exhibit universality, in the sense that 
their behaviour is often independent of the microscopic 
details. It is then convenient to study simple models, in 
the hope that the results will extend to more complicated 
systems. A common choice is that of particles undergo- 
ing a random walk on a lattice [2]. For a single-particle 
random walk, first-passage times to a given site in an in- 
finite system have long been studied [3] , and much is now 
known about their statistics [2,4]. Recent progress in this 
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direction was made by Condamin et at, who studied first- 
passage times and distributions for a single particle diffus- 
ing from a source site to a destination site in a confined 
region [5-7]. Other properties, however, are relevant only 
when many walkers are present, for example the sequence 
of times for each to arrive at a given point [8], and the 
territory covered after a given number of steps [9, 10]. 

In this Letter, we study the following first-passage prob- 
lem for systems containing many random walkers on a pe- 
riodic lattice: how long does it take for k of the walkers to 
accumulate at a given site? For k much larger (or much 
smaller) than the mean density, this corresponds to a large 
local density fluctuation, and is thus a rare event. A re- 
lated quantity was studied numerically in [11], as a model 
of particles traversing a membrane pore. Special cases 
have also been studied in relation to Ritort's backgam- 
mon model [12,13], and another related model was recently 
used in a study of cooperativity in chemical kinetics [14]. 
The general problem has, however, received scant atten- 
tion, despite its relevance for many physical systems. 

Indeed, the problem pertains to any systems whose dy- 
namics may be affected by the accumulation of diffusing 
particles in a given region. A particular physical exam- 
ple that we have in mind is a fluid of hard discs exhibiting 
glassy behaviour. When the density of discs is close to the 
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fluid-solid transition, configurations become "jammed" , 
resulting in slow, glass-like dynamics [15]. However, even 
in this situation, each disk can be surrounded by a small 
amount of "free volume" in which the disks "rattle" about. 
This limited disk motion gives rise to an effective free vol- 
ume transport, which, on large scales, may be viewed as 
diffusive. When enough free volume accumulates in a re- 
gion of the system, a neighbouring disc can move into the 
resulting space, causing a large local rearrangement of the 
jammed structure. The dynamics of these rearrangements 
is thus controlled by the rare accumulation events, which 
can be viewed as density fluctuations of the free volume. 

In this work, we focus on the simple multiple random 
walker problem, which still poses rather formidable chal- 
lenges. Actually, for systems containing many random 
walkers, exact results are known for first-passage times 
from one configuration to another, in terms of eigenval- 
ues and eigenvectors of the transition matrix, for the case 
where a single walker moves at each time step [16, 17] or 
where all walkers can move simultaneously [18]. But these 
results do not extend easily to the case of interest here, 
where we are concerned with transitions from a large set 
of rather disparate configurations {i.e. all configurations 
having k walkers at a given site) to another. 

Since diffusive processes give rise to spatial and tempo- 
ral correlations that render the problem analytically in- 
tractable, we also consider a mean-field version of this 
problem, in which particles can jump not only between 
neighbouring sites, but to any of the sites in the system 
with equal probability, which reduces the problem to a 
type of Ehrenfest urn model [13,19]. We show numeri- 
cally that this approximation provides a qualitative ex- 
planation of the behaviour of the mean first-passage time 
also for diffusive dynamics. 

The model. — We consider N independent random 
walkers on a lattice with volume (=number of sites) V and 
periodic boundary conditions. In this paper we study only 
regular "cubic" lattices of spatial dimension d (a ring for 
d = 1, square lattice for d = 2 and simple cubic lattice 
for d = 3). In ID, at each time step a single walker is 
selected to move right, move left, or stay where it is, with 
probabilities p, q, and r = 1 — (p -I- g), respectively. For 
2D square and 3D simple cubic lattices, we study only the 
case in which the chosen walker moves one lattice spacing 
in any direction with equal probability. 

We are interested in the event {i.e. the set of configura- 
tions) Sk that a distinguished site, labelled by 0, is occu- 
pied by exactly k particles; in particular, we are interested 
in the first time at which this event occurs, that is, when 
the system configuration lies inside the set Sk- The mean 
particle density (number of particles per site) p :— N/V 
plays a crucial role here: for k either much larger or much 
smaller than this mean density, the event Sk is rare. From 



the static point of view, this means that the probability 
that the event occurs is small. From the dynamical point 
of view of interest here, it means that the time needed to 
actually see the event occur is, on average, very long. 

Starting from an initial condition with the particles dis- 
tributed randomly, the main question of interest is thus 
how long it takes for the rare event to occur - that is, we 
are interested in the mean first-passage time t^^ for the 
system to reach Sk, averaged over all random initial con- 
ditions which are not in Sk [6]. In this paper we consider 
the case of relatively low mean density, of order p — 1, 
and fluctuations to large local densities k ^ p. In sys- 
tems where the density is high, a fluctuation in which a 
site becomes nearly empty is also a rare event with impor- 
tant physical applications, such as in Ritort's backgam- 
mon model [12]. This case can also be studied using the 
methods described here, as we will discuss elsewhere. 

Mean-field approximation. — In order to make an- 
alytical progress, we consider a mean-field version of the 
model, in which the walkers may jump to any other site, 
with equal probability 1/{V — 1); this is equivalent to con- 
sidering dynamics on a completely-connected graph. The 
model then reduces to an Ehrenfest urn model [13]. 

In this case, the particles which are not at the distin- 
guished site '0' can be regarded as belonging to a reser- 
voir containing N — uq particles, the locations of which 
are irrelevant. The occupation number uq of site '0' then 
undergoes a Markovian random walk whose hopping prob- 
abilities depend on its current position, for which exact 
results are known [20,21]. In the context of urn models, 
further results were found in [19,22]. We approach the 
problem from within this context. 

We denote the mean first-passage time from Si to Sm 
by Ti^m- It is convenient to first study t^ := Tk^k+i, the 
mean first-passage time from Sk to S'^+i. Suppose that at 
a given time step, there are k particles at the distinguished 
site, so that the system is in Sk. There are then three pos- 
sibilities, whose probabilities depend explicitly on k: with 
probability Pk, a walker from the reservoir lands at the 
distinguished site, thereby reaching Sk+i', with probability 
7fe, a walker from the distinguished site leaves to the reser- 
voir, giving Sk-i', and with probability ak :~ 1 — /3fe — 7a;, 
a walker moves around inside the reservoir, without af- 
fecting site 0, so that the system remains in Sk- In the 
mean-field case, these probabilities are independent of the 
microscopic configuration, and are given by 

A = ^^(l-r); 7.-^(l-r). (1) 

Here, k/N is the probability of choosing a site containing 
k of the N walkers. The factor (1 — r) appears in all 
such probabilities, and results in a factor 1/(1 — r) in the 
expressions for mean times; for simplicity, we omit it in 
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the following, considering the case where the particle must 
jump, i.e. p + q = 1. 

After a given step, which takes time 1, if the system 
finds itself in some Si, then the subsequent mean first- 
passage time to Sk+i is Ti^k+i- By conditioning on the 
outcome of a step starting at Sk , we obtain 

T,t = afc(l + T+)+/3fc(l + rfc+i^fe+i)+7fe(l + Tfe-i^fc+i)- 

(2) 
Since only one walker moves at each step, to reach Sm 
from Si, we must first pass through all intermediate 
Si. We thus have additivity of mean first-passage times: 
Ti^m — "^iLi ■''i for I < m. In particular, to reach Sk+i 
from Sk-i, we must pass through Sk, so that Tk-i^k+i = 
rt ■ Furthermore, we have Tm^m = for any m. 
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Inserting these results in eq. (2), we get 
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akT^ 



-lk{T^- 



(3) 



and hence, by rearranging, we obtain a first-order recur- 
rence relation giving r'j^ in terms of t^j^: 






(4) 



Asymptotics. — The recurrence relations given above 
yield closed-form results for mean first-passage times, ex- 
pressed as sums [21]. That representation does not, how- 
ever, allow us to easily understand the behaviour as a 
function of the system parameters. Instead, we focus on 
asymptotic results which give the dominant behaviour. To 
do so, we use the Kac recurrence theorem [6,23,24]. This 
states that the mean recurrence time to a set A in an er- 
godic system is given exactly by rj^"^ — por , where P [A\ 
is the stationary probability that the system is in A. By 
using the results of the previous section, this will then 
provide information on mean first-passage times. We re- 
mark that this result of Kac is apparently still not well 
known in the physics literature: for example, refs. [16-18] 
resorted to much more complicated techniques to calcu- 
late Poincare cycles {i.e. mean recurrence times) in these 
systems; see also [5,6]. 

We denote the set of microscopic configurations by Q.. 
These are given by vectors of size N containing the po- 
sitions of all walkers. Each such configuration is equally 
likely, since the transition probabilities between them are 
symmetric, so that the Kac formula gives the exact mean 
recurrence time to the set Sk as 



this result was first derived in [20] by a different method. 
The boundary value t^ is the mean of a geometric dis- 
tribution with parameter ag, and is hence given by Tq = 
^^^ = -g-. By interchanging f3k and 7^, we find a simi- 
lar recurrence relation in the other direction for the mean 
time Tf^ := Tk^k-i, obtaining Tf^ = (1 + (3kTi7+i)/lk- 
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where |^| denotes the number of configurations in the (fi- 
nite) set A, obtained by combinatorial arguments, and ( j,) 
is a binomial coefficient. 



An important related concept in random processes is the 
recurrence time to a set A, that is, the time required for 
the system to return to A, given that it started there [23] . 
The mean recurrence time r^'"^ to the set Sk is found, again 
by conditioning, to satisfy 
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For large N and V, fixed density p = N/V, and fixed 
k -^ N, we substitute the asymptotic results Nl/{N — 
k)l ^ N'^ and 1 — y ^ exp(— l/V) into the exact recur- 
rence time expression (8), to obtain the asymptotics 
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In the situation that we are considering, for fluctuations 
to fc 3> p, the mean times t^ to decrease k are small, 
whereas the times t'^ to increase it are large. We thus 
obtain the following approximate relation between the re- 
currence time and the first-passage time: 
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(6) 



and hence 

rr^fc!p-'=exp(p). (10) 

Physically, the total number of particles present should 
not affect the dynamics of a given particle, so that the 
physical time is the number of steps per particle (i.e. the 
physical time unit corresponds to one "sweep" through the 
N particles in the system). Since t^^ ~ T^fc""^' ^® finally 
obtain the following approximate asymptotic expression 
for the per-particle first-passage time to Sk'. 



The mean first-passage time to the set Sk from a random 
initial condition, denoted t^^ , is now given by a sum of 
the Tjn^k, each weighted by the probability of starting in 
Sm- This is dominated by the largest term, ro^fc, which in 
turn is dominated by t^-^, giving the approximate result 
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Asymptotically, the mean-field first-passage time thus de- 
pends on N and V only via the density p of walkers for k 
small compared to the total number of particles. 
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(Y) Figure 1 compares the exact and asymptotic mean-field 
first-passage times per particle (from initial occupation 
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number to final occupation number k) for several system 
sizes at fixed density. The times are indeed independent of 
system size, provided that k is not close to N. The curva- 
ture visible in the plot shows that the growth is faster than 
exponential: indeed, asymptotically log(fc!) ~ fclogfc — fc, 
giving a logarithmic correction on a semi- log plot. For 
larger values of fc, a different approach is needed; however, 
Arora et al. [19] showed that for k = N the asymptotic 
result is t^^ ~ V^ , which coincides with the asymptotic 
behavior of our result when we put V — N/p. 
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Fig. 1: Exact results for mean first-passage times ro^fe per 
particle in mean field for different system sizes V, compared to 
the asymptotic result (11), as a function of the size k of the 
fluctuations. The mean density is fixed at p = 1. 

Comparison between mean-field and spatial 
cases. — The relation between the mean first-passage 
time Tit_i and the mean recurrence time TJi°'^ was derived 
in the mean-field approximation. Nonetheless, the follow- 
ing argument shows that it should also provide a reason- 
able approximation in the case of diffusive dynamics. If 
we start in Sk for a large fc, then it is unlikely that we will 
add another particle to the distinguished site. Rather, af- 
ter a waiting time, we will likely drop down to S^-i, and 
from there most likely to 5'fe_2, and so on - that is, we can 
assume that once we have left Sk, we escape from there 
back to a "random" condition, from where the system will 
reach Sk+i via a first-passage process. Thus, conditioning 
on whether we stay in, or leave, Sk, we obtain 



(1 X P [stay]) + (rfP x P [leave]) (12) 

l+lk{rr-l). (13) 



Neglecting the fast escape process, we obtain the same 
approximate relation t^^ ~ '''k"^ Ilk as before, so that the 
mean-field result should also describe qualitatively the be- 
haviour in the spatial case. This also explains why higher 
dimensions are closer to mean-field: once the system has 
left Sk, it is much harder to return to it, since even if we 



choose a particle in a neighbouring site, it has many more 
ways not to return (probability 1 — {l/2d)) than to return 
(probability \/2d) to the distinguished site. 

Figure 2 compares the mean-field results to unbiased 
nearest-neighbour single-particle motion on "cubic" lat- 
tices with dimensions d = 1,2,3 and a fixed number of 
lattice sites. We see that the mean-field results reproduce 
very well the qualitative dependence of the first-passage 
time on fc. Note that due to the very rapid increase in 
the first-passage time as a function of fc, we are unable to 
reach large values of fc with brute- force simulations^. We 
also show sample results for a higher density and smaller 
system size, for which larger values of fc can be reached. 

For a quantitative comparison, the ratio of the diffusive 
to mean-field results is shown in fig. 3. As expected, mean 
field is closer to the spatial results for higher spatial di- 
mension. Surprisingly, for d = 2 spatial dimensions the 
mean-field result is also a quantitatively good estimate, 
provided the event is actually rare, i.e. for moderate fc. 
Thus spatial correlation effects are very strong in ID, but 
much less so in higher dimensions, in agreement with the 
above discussion. 

Furthermore, in all cases, the mean-field results become 
increasingly accurate for larger fc. This can be explained 
as follows. For small fc, in the mean-field case, particles 
can arrive rapidly at the distinguished site from any part 
of the system and quickly cause the required fluctuation. 
In the diffusive case, however, the arrival of particles at the 
site is controlled by slow diffusive processes, for which the 
distance a particle is able to move in a time t scales only 
like \/t. For large fc, on the other hand, the first-passage 
event takes so long that each particle has enough time 
to explore the whole (finite) system by diffusion, and the 
problem is reasonably independent of the spatial details. 

This independence can also be explained in terms of 
a recent result due to Condamin et al. [7]. Our many- 
walker problem can be mapped onto the motion of a single 
walker in an (7V(i)-dimensional hypercubic domain with an 
appropriate target set. The result of ref. [7] shows that the 
first-passage time for the applicable case of "non-compact" 
exploration of the domain should depend rather weakly 
on the distance to the target, and hence on whether the 
dynamics is diffusive or mean-field. We intend to explore 
this approach in future work. 

Effect of bias. — To further study the relation be- 
tween the spatial and mean-field cases, we studied in ID 
the effect of the bias parameter p, which is the proba- 



^We have explored the possibility of using a more efficient algo- 
rithm along the lines of partial-path sampling [25] to evaluate these 
large first-passage times. Although these methods work very well for 
the mean-field case, for diffusive dynamics the strong spatial corre- 
lations present makes their implementation a difficult task. 
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Fig. 2: Mean first-passage times t^^ as a function of k for 
cubic lattices of spatial dimension d = f,2, 3, for system size 
V = 729 = 27 =9 and density p = f, compared to the 
mean-field result. Data are also shown for a ID diffusive chain 
with V = 40, p = 2, and the corresponding mean-field result. 
Error bars are smaller than the size of the symbols, and lines 
are shown as a guide for the eye. 



bility that a particle jumps to the right, always fixing 
p + q = 1, so that the chosen particle always jumps. Fig- 
ure 4 shows the result of varying p between 0.5 (symmetric 
case with no bias) and p = 1 (totally asymmetric motion 
to the right). As p varies between these two extremes, 
the first-passage time curve interpolates smoothly between 
the symmetric and mean-field cases, with the numerical 
data falling on top of the mean- field curve for p = 1. We 
can understand this as reflecting the decreasing influence 
of spatial correlations. When p = 1, spatial correlations 
propagate to the right only, so that the incoming particles 
to the distinguished site have, in effect, no information 
about these spatial correlations, just as in mean fleld. 

Fluctuations anywhere in the system. Finally, 
we study the problem of the flrst-passagc time f^^ for any 
site in the system to reach occupation number k. Even in 
mean field, exact calculations for this case are very diffi- 
cult: we are aware of solutions only ioiV = 2 sites [22] and 
for the special case in which k — N [19], i.e. in which all 
particles collect at one site, solved using a clever method 
which unfortunately does not appear to generalise to other 
values of k. Solving the case fc = would yield an exact so- 
lution for zero-temperature glassy relaxation in the Ritort 
backgammon model [12, 19]. 

Instead of an exact solution, we can relate this case 
to that with a distinguished site. The set of configura- 
tions Sk forming the rare event is now approximately V 
times larger than before, since configurations with k par- 
ticles on any of the V sites are included in the event. The 
Kac method then shows that recurrence, and hence first- 
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Fig. 3: The diffusive data of fig. 2, divided by the corresponding 
mean-field first-passage times for the same system parameters. 



passage, times are approximately V times smaller than be- 
fore. Alternatively, we can argue that the sites are roughly 
independent, with exponentially distributed first-passage 
times. The minimum of these times - the time until the 
first site reaches occupation number k - is then also expo- 
nentially distributed, with a mean V times smaller. 

To test this, we plot in fig. 5 the first-passage time to 
reach occupation number k = 6, 7, 8 somewhere in the 
system. In this figure, the times have not been divided by 
N as in the previous graphs, and hence they correspond 
to pV times the physical mean first-passage time to the 
event. Since in the distinguished case, for a fixed density, 
the times per particle converge to a constant as V ^> oo 
(as seen, for example, in fig. 4), the above argument would 
imply that the raw total number of steps should converge 
to a constant in the current case. In fact, we see that the 
numerically-determined times decay slightly faster than 
the 1/V expectation, apparently by a small inverse power 
of V , the exponent of which is roughly independent of the 
value of k. (For k — 6 the times are very short, and the 
apparent saturation is not relevant.) 

This observation can be explained as follows. As the 
system size T^ — > oo, configurations with k particles on 
some site are much more likely to occur. Each such con- 
figuration contributes to the calculation of the mean 
first-passage time. However, the probability of lying out- 
side the set Sk of such configurations decays exponentially 
with V. Including them in the calculation thus results in 
an exponential decay of the mean first-passage time with 
V, which merely measures the volume of Sk rather than 
the dynamics of the system, and for this reason we disallow 
such initial conditions. Nevertheless, as V grows, there is 
still an ever greater probability that some sites have occu- 
pation numbers close to k. These sites can more easily be 
reached by other particles, thus lowering the mean first- 
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Fig. 4: Mean first-passage times per particle to occupation 
number A: = 6 at a distinguished site, on a ID lattice as a 
function of system size V, for different values (labelled) of the 
bias parameter p, with density p — 1. The mean-field (MF) 
result is shown for comparison (black dots). 



passage time from its expected 1/V behaviour. 
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Fig. 5: First-passage times t^^ , as a function of system size V , 
to reach occupation number fc = 6, 7, 8 anywhere in the system, 
starting from random configurations where no sites have this 
occupation number. Times shown are numbers of steps. 

Conclusions. — In summary, we have shown how the 
mean first-passage time to rare events in a many-particle 
random walk model depends on the system parameters. 
We found that the first-passage time to have k particles 
at a distinguished site grows asymptotically as k\/ p^ for 
the mean-field case, and we showed that this agrees in- 
creasingly well with the spatial results for increasingly rare 
events. We also showed how the results change for fluctu- 
ations anywhere in the system. 

In the future, we intend to study the full probability 
distribution of first-passage times. A further interesting 



extension would be to study the statistics of first passage 
to large density fiuctuations for diffusive processes on het- 
erogeneous - e.g. scale-free - network structures. 
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